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We develop a relativistic velocity space called rapidity space from the single as- 
sumption of Lorentz invariance, and use it to visualize and calculate effects resulting 
from the successive application of non-colinear Lorentz boosts. In particular, we 
show how rapidity space provides a geometric approach to Wigner rotation and 
Thomas precession in the same way that spacetime provides a geometrical approach 
to kinematic effects in special relativity. 



The most commonly used Lorentz transformation relates an inert ial frame S to an inertial 
frame S' moving with a velocity v along the x-axis of S. If this Lorentz transformation also 
preserves the orientation of the spatial axes of S and leaves the sign of its time component 
unchanged (as is usually the case), then it is called a boost. In some cases an inertial frame S' 
is obtained from an inertial frame S by two successive boosts. If the two successive boosts are 
non-colinear then, contrary to what one might expect, the single Lorentz transformation that 
is their resultant is not a pure boost but rather is the product of a boost and a rotation. 
The unexpected rotation was discovered by Thomas^ in 1926, and derived thirteen years 
later by Wigner— in his seminal article on representations of the Lorentz group. If successive 
non-colinear boosts return the spatial origin of S' to the spatial origin of S, then all of 
the Thomas- Wigner rotations along the way combine to produce a net rotation of S' with 
respect to S called the Thomas precession^'^'^'^^ 

Thomas precession is an essential part of quantum courses discussing relativistic correc- 
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tions to the Hamiltonian of a hydrogen atom because it changes the non-relativistic form of 
the spin-orbit term by a factor of one-half. Rather than derive this result, however, some 
texts state it without giving any references,'^ some state it and reference only Thomas' orig- 
inal article,-^ while others state it and appeal to the Dirac equation for its justification . ^^'^^ 
The few texts and journal articles which do derive Thomas precession often use mathemat- 
ics that is somewhat sophisticated,— such as infinitesimal generators of the Lorentz group,— 
"a weakly associative-commutative groupoid,"— "gyrogroups and gyrovector spaces,"^- the 
Gibbs method for adding finite rotations,— holonomy group transformations and Clifford- 
Dirac algebra,-^ the tetrad formalism,^^^ Fermi- Walker transport or unboosted Fermi- 
Walker frames,-^. Although several texts do present straightforward algebraic derivations, 
they are long and somewhat tedious.-^*^ Recalling that many relativistic effects are easier to 
derive and understand when treated geometrically in spacetime, we wondered if there was 
a relativistic velocity space in which Thomas precession (and other effects involving succes- 
sive non-colinear boosts) could be treated geometrically and in this way also made easier to 
understand. 

Perhaps the most intriguing approach to constructing a relativistic velocity space was 
mentioned in the 1950's by Landau and Lifshitz.^^ They begin an exercise for the reader by 
noting that given two non-colinear relativistic velocities v and v + dv, the relative velocity 
dv can be considered as a line element in a three-dimensional velocity space in which each 
point is specified by the azimuthal and polar angles of v and a radial coordinate equal to a 
function of v called the rapidity. Landau and Lifshitz then ask the reader to show that this 
relativistic velocity space is non-Euclidean, with a hyperbolic geometry. 

Landau and Lifshitz do not reference the origin of this exercise, so it is not clear if they 
discovered the velocity space themselves or are drawing upon work published previously in 
the Russian literature. Both Pauli^^ and Rosenfeld^^ credit a paper written in Russian by 
the Croatian mathematician Varigak as the first place in which relativistic velocity addition 
was related to the analog of vector addition in a hyperbolic space. Pauli cites four addi- 
tional articles (also written in Russian) by Varigak, published between 1910 and 1919, and 
Rosenfeld notes that Varigak summarized and expanded upon his work in a book (written 
in Russian) published in 1924. Rosenfeld cites only two references to work on this subject 
that appeared after 1924, both of which are books written in Russian. One was published 
in 1963 and the other in 1965. 



3 



Given this history, it seems hkely that Landau and Lifshitz's text was the first written 
in Enghsh to mention a relativistic velocity space with hyperbohc geometry. Indeed, in 
1997, when Aravinci^i showed how the Thomas- Wigner rotation and Thomas precession had 
properties identical to those of areas in a hyperbolic space, he credited this discovery to "the 
crucial hint . . . from Landau and Lifshitz. . . ." More recently, Criado and Alamo2S chose a 
hyperboloid in spacetime to represent a relativistic velocity space, which they mapped onto 
a unit disk with hyperbolic geometry (called the Poincare disk). They then drew on results 
from non-Euclidean geometry, such as the law of cosines and the equations of geodesies in 
a hyperbolic space, to show how certain properties of hyperbolic triangles correspond to 
certain properties of relativistic velocities and velocity addition. 



The interesting results in Refs. |2J and 12^ are not readily accessible to many physicists 
because they assume a familiarity with formulas and theorems from non-Euclidean geome- 
try. Furthermore, although these articles make the connection between relativistic velocity 
addition and hyperbolic geometry compelling, neither explains this connection nor develops 
it systematically from first principles. 

The purpose of this paper is to derive a relativistic velocity space (called rapidity space) 
from first principles, and to demonstrate how it provides a geometric approach to solv- 
ing problems involving the relativistic addition of non-colinear velocities and successive, 
non-colinear, Lorentz boosts. The development is self-contained and assumes no previous 
knowledge of hyperbolic geometry. Beginning with the single requirement of Lorentz invari- 
ance, we construct rapidity space using an approach that parallels the one used to estabhsh 
spacetime. We find that just as many kinematic effects in special relativity are more easily 
and elegantly understood once the spacetime metric 



is established (in a spacetime with two spatial and one time dimension), so too are many 
aspects of the addition of non-colinear boosts more easily understood once the rapidity space 
metric 



is established (with x and y related to the usual components of velocity, as defined in 
Sec. E)). In particular, once the main properties of rapidity space have been developed, 
exact expressions for the Thomas- Wigner rotation and Thomas precession can be found 




ds^ = dx^ + dy"^ - c^dt^ 



(1) 




(2) 
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geometrically. Furthermore, working in rapidity space allows various qualitative aspects of 
these effects to be deduced geometrically, some of which are more difficult to prove with 
algebraic equations alone. Indeed, we have found relativistic velocity (rapidity) space to be 
as useful for understanding the relativistic addition of non-colinear velocities and Lorentz 
boosts as spacetime has been for understanding kinematic effects in special relativity. 

As mentioned above, this paper derives a 2D relativistic velocity space (called rapidity 
space) from the single assumption of Lorentz invariance. Although developing the material in 
this way is logically satisfying, and has the additional benefit of unifying (and occassionally 
correcting) results presented previously, it does make the paper rather long. Those who 
would rather bypass the derivations and proofs, and simply accept that there is a relativistic 
velocity space whose metric is given by Eq. ()49cj) and whose geodesies are the ones described 
at the end of Sec. IVIl can proceed directly to the applications presented in Sec. I VI II Sections 
im through IVII present the proofs and derivations needed to establish the relativistic velocity 
space and its properties, while Sec. lVIIll is included for those interested in how some of what 
is presented here is expressed using group theory, quaternions, spinors, etc. 



Consider two inertial frames S and S' whose origins are coincident when t = t' = 0, and 
whose X and aligned. (Two inertial frames related in this way are said to be in the 

standard configuration, and it is easy to show that the linearity of Lorentz transformations 
always makes this choice possible for any two inertial frames 5* and S".— ) As mentioned in 
the Introduction, a non-trivial Lorentz transformation from S to an inertial frame S' moving 
with a velocity v with respect to S is called a boost if it preserves the orientation of the 
spatial axes and leaves the sign of the time component unchanged. If the boost is in the 
x-direction, then the transformation equations are 



II. NOTATION AND BACKGROUND 




(3a) 
(3b) 
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with 7 = 1/ a/1 — j(? . In what follows, we restrict ourselves to spacetimes with one time 
and two spatial dimensions because this is sufficient for understanding the most common 
cases of Thomas rotation and precession.— 

For convenience, let X\ = x, X2 = X3 = ct, and /? = vjc. Using this notation, 
7 = (1 — and Eq. Q becomes 



J 



X2 



(4) 



7 -7/3 
1 
y-7/5 7 

If we let X represent the column matrix on the right-hand side of Eq. (^, then the length 
(norm) squared of x can be expressed as 

/l \ 



_|_ ^3 ^2 rn'^ 



1 



X = X Gx. 



(5) 



VO -Ij 

More generally, any linear transformation A is a Lorentz transformation if and only if it 
leaves the spacetime metric 

x'^Gx' = x^Gx (6) 
invariant for all x, or equivalently, if and only if 

A^GA = G. (7) 



III. RELATIVISTIC VELOCITY ADDITION AND THE RAPIDITY 



The rapidity of a boost (3 is defined as 



Thus, 



(J) = arctanh f3. 

P = tanh0, 
7 = cosh0, 
7/9 = sinh0. 



(8) 

(9a) 
(9b) 
(9c) 
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Using the rapidity allows Lorentz boosts to be expressed in two alternative and interesting 
ways. In the first, if we substitute the rapidity into Eq. (0)), we obtain 

cosh (f) — sinh 











X2 



(10) 



V^s/ \~ si^h cosh J \x3 J 

which illustrates that the rapidity can be interpreted as an imaginary rotation angle in 
spacetime. 

A second way of expressing Lorentz boosts is found by introducing the new coordinates 
(e,r/), with2^ 

^ = X3 + Xi and 1] = X3 — Xi. (11) 
If we use these coordinates, Eq. (fTUj) can be expressed in the simple form 



V 



(12a) 
(12b) 



For future reference, note that Eq. (fT^ also can be written in the form 

e' = (7-7/5)e = 

= (7 + iP) V = 



l + p 

'l + (3 
1-/3 



(13a) 
(13b) 



The ^ and rj coordinate axes lie on the light cone (a/^ 



±ct) and transform into 



themselves under this type of Lorentz boost. Expressed in another way, these axes are 
eigenvectors of the boost in Eq. (fTUI) with real eigenvalues e^"^ which, as can be seen from 
Eq. (fT^ and (fT^ . are simply the blue- and redshift factors in the relativistic Doppler 
effect.29: 

The rapidity is most commonly used to simplify the addition of colinear relativistic ve- 
locities. As is well known, the relativistic addition of two colinear velocities Vi and V2 gives 
a resultant (colinear) velocity v with magnitude 



Vi + V2 



1 + {viV2/c^) 



P1+P2 
1 + Plf32 



(14) 
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The correct generalization of Eq. p4|) to the relativistic addition of n cohnear velocities is 
not obvious. However, if we reexpress Eq. ()14|) using the rapidity, we find that 



Using the identity 



we have 



arctanh/3 = arctanh ( ] ■ (1^) 

1 + P1P2 , 



arctanh a = - In f — — — | , (16) 



1 (l + A)(l+/?2) ..7. 



Thus, 



= arctanh /5i + arctanh (18b) 
= 01 + 02. (18c) 

If we express the sum of two colinear velocities in the form of of Eq. (|18|) rather than in 
the usual form of Eq. (HH), the relativistic sum of n colinear velocities Pi, P2, ■ ■ ■ , Pn is easily 
shown to have magnitude /3, with 

(f) = arctanh /3 (19a) 

^ 1 (l + A)(l + /32)...(l+/30 

2 (l-/3i)(l-/32)...(l-/5„) ^ ^ 

= arctanh Pi + ■ ■ ■ + arctanh Pn (19d) 

= 01 + 02 + . . . + 0n. (19e) 

Thus, the rapidity provides an easy way to express the relativistic sum of n colinear velocities 
when n > 2. 

Note that using the Lorentz transformation Eq. (fT^ allows us to express Eq. ()19|) in 
another useful form.^° If we combine Eqs. ()12b|l and ()13b|) . we have 



1 + P 
1-/3' 



Using this relation in Eq. ()19ep we find that 



(20) 



(21) 
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which imphes that 



(22) 



\1-PJ \l-P2J \1-Pn, 

Equation ()22|) provides a surprisingly easy way to find the resultant (3 of the relativistic sum 



of n colinear boosts. 



IV. RAPIDITY SPACE 



In this section we derive a 2D relativistic velocity space, called rapidity space, directly 
from the 3D spacetime of special relativity (that is, from the spacetime with two spatial and 
one time dimension). If we use the coordinates Xi, X2, and X3 defined in Sec. HH the square 
of the line element for this spacetime is 

ds"^ = (dxi)^ + {dx2f - {dx^f. (23) 

We choose this particular form of ds^ because in the = plane, it reduces to the usual 
Euclidean relation 

ds| = {dxif + {dx2f. (24) 

Suppose we fix ourselves in one inertial frame and consider another with the same space- 
time origin but moving with a velocity v relative to the first. The spatial origin of this 
second frame appears to us as following a straight line {x{t),y{t)) = vt, where (f/c) < 1. 
Thus its trajectory is a line emanating from the origin and lying within the light cone. This 
line also can be described in Xj-coordinates as the one formed by all the scalar multiples of 
the vector (/?, 1), where (3 = v/c. If we turn this statement around, we can say that every 
straight line through the origin that lies within the light cone represents the trajectory of the 
origin of some inertial frame traveling with a velocity v relative to the fixed inertial frame 
represented by the spacetime. 

Another way of describing all the straight lines through the origin and within the light 
cone is to note that each can be viewed as the x'^ axis of some inertial frame obtained from 
the original (xi,a;2,a;3) frame by a unique boost. Because each Xg axis corresponds to one 
particular velocity (and vice versa), we can create a model of velocity space by choosing one 
point from each x'^ axis. The set of all such points is a velocity space because each point in 
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it represents a unique velocity, and because all velocities (3 with magnitude (3 < 1 will be 
represented. 

There are several natural ways to construct a relativistic velocity space from the set of 
points defined in the previous paragraph. For example, we could start with all the points 
lying in the plane X3 = 1 and inside the lightcone; alternatively, we could start with all the 
points lying on the hyperboloid with X3 > 0. 

As shown in Fig. the first choice is the simultaneity plane X3 = 1 for an observer in 
the inertial frame represented by our spacetime, while the second is the set of all points 
for which the proper time r = 1. The first choice results in a velocity space known as the 
Klein model, which is not the best choice for our purposes because angles in this model do 
not appear like Euclidean angles (that is, the Klein model is not conformal.^^) The reason 
we would like a conformal model is that the relativistic addition of non-colinear velocities, 
the Thomas- Wigner rotation and the Thomas precession, are built on understanding angles 
between successive boosts, so only those spaces in which angles behave like Euclidean angles 
can be expected to offer the geometric insight we seek. 

The second choice, of all the points that lie on the hyperboloid of revolution, does result 



25. This choice is both 



in a conformal velocity space and is in fact the one chosen in Ref. 
reasonable and convenient because the metric on this space and the geometric properties 
that follow from it are well-known to mathematicians. However, few of us are good at 
judging angles on a curved surface. 

So how do we motivate or justify choosing one model over another? Given that we are 
free to choose any surface created by any method of choosing one point from each X3 axis, 
why choose the hyperboloid? How do we know there isn't some other way of choosing a 
point from each that will lead to an even more convenient or appropriate model of 

velocity space? 

Rather than trying to justify one choice over another after the fact, we construct a model 
of relativistic velocity space from first principles by following the method used to derive 
spacetime and the spacetime metric. In that case, spacetime is developed from the physical 
requirement that the speed of light is independent of the motion of the source and is the 
same in all inertial frames. The mathematical statement of this property, = (ct)^, implies 
that 

+ 1/2 - {ctf = 0, (25) 
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and leads to interpreting the quantity 

x^ + y"^ - {cif (26) 

as the square of a distance in a 3D spacetime where Lorentz transformations are represented 
by hnear coordinate transformations A satisfying Eq. (0). Thus, rather that choosing a priori 
the nature of spacetime, a physical invariance is used to deduce a metric that determines its 
mathematical properties. 

We use this same approach to deduce the geometry of a relativistic velocity space. We 
note that Lorentz transformations acting on spacetime also act on the set of rays inside 
the light cone emanating from the origin, and that each of these rays has a one-to-one 
correspondence with a rapidity. Thus, the invariance of the spacetime metric Eq. ()23j) under 
Lorentz transformations can be used to define a metric on the rays (or rapidities) that also 
is Lorentz invariant. 

To find this metric, first note that because we are choosing the points in rapidity space to 
correspond to rays inside the light cone that emanate from the spacetime origin, the metric 
in rapidity space should be expressible in terms of the spacetime coordinates. That is, there 
should be functions fij such that the line element squared in rapidity space can be expressed 
in the form 

3 

ds^ = fij{xi, X2, X3)dxidxj. (27) 

However, because any two points on the same ray in spacetime specify the same rapidity, 
the line element in rapidity space must be the same regardless of which spacetime points on 
the ray we choose. Thus, for any A, we require 

ds'^{xi,X2,X3) = ds^{Xxi, Xx2, Xxs). (28) 

The spacetime form ds'^ = dx\ + dx\ — dxl does not have this property because 

ds'^{\xi, Ax2, Axa) = A^(is^(xi, X2, x^). (29) 

However, we can obtain a ds^ with the property given in Eq. ()28|) by using a simple but clever 
trick: first take the logarithm of both sides of Eq. (j^^ (which changes the multiplication by 
A^ into the addition of InA^), and then differentiate (so that the InA^ term disappears). 
More formally, the spacetime inner product, 

y) = -x-y = -xiyi - X2I/2 + x^y^, (30) 



11 



is positive for rays within the hght cone, and has the property 

q{Xix, A22/) = AiA2g(a;, y). (31) 
By taking the logarithm of both sides, we find 

In g(AiX, A2?/) = In Ai + In A2 + In q{x, y). (32) 
Finally, taking the differential of both sides respect to x and y, we obtain 

d^dy In q{XiX, A2I/) = d^dy In g(x, y). (33) 
In this way we are led to look for a rapidity space metric whose inner product has the form 

d^dy In (-a; ■ y) = d^dy[\n {x^y^ - xm - X2y2)] (34) 

Ixsdy^ - xidyi - X2dy2 

= flx 

L XsVs - xwi - X2y2 

= ■ yy^iidx ■ dy){x ■y)-{x- dy){y ■ dx)]. (36) 

Setting X = y we see that the line element squared in this relativistic velocity space should 
have the form 

ds'^ = K(x ■ [{dx ■ dx){x ■ x) — {x ■ dxY'\ , (37) 

where K is an arbitrary constant. 

Recall that even though the line element in Eq. (j37|) is expressed in terms of spacetime 
coordinates, it also is a function of the rays inside the light cone and through the origin on 
which those spacetime points lie. 

As mentioned above, in order to better visualize the geometry that follows from ds'^ in 
Eq. (IHTj) . we may choose as a model of relativistic velocity space any surface that intersects 
each ray in exactly one point. If such a surface is expressed as 

X3=g{xi,X2), (38) 

then by substituting this expression for X3 into Eq. p7jl . we can express the metric in terms 
of the two coordinates xi and X2. 

Although we have great freedom in selecting the function g, several judicious choices will 
greatly simplify our model. First, because Lorentz transformations include rotations, and 



(35) 
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because we seek a metric that is invariant under Lorentz transformations, it is natural to 



choose a surface that has rotational symmetry. Thus, we require that 



x^ = g (^a/(xi)2 + (x2)2j = g{r). 



(39) 



To find dx^ in terms of Xi and we apply the chain rule to Eq. (jH^ . giving 



(ixs = — ixidxi + X2dx2) , 
r 



with 



Because 



dr 



2 2 2 2 2 

we can rewrite Eq. as 

cis^ = K{g^ - r2)-2 [(da;^ - dxl - - r') 

— (xsdxs — XidXi — X2dX2Y ■ 
If we use Eqs. flS^ and pUj) in Eq. (jlSj) and do some algebra, we find that 



(40) 



(41) 



(42) 



(43) 



ds^' 



K 



J.2 _ g2 



{dxl + d'^l) '~ ^ 



1 _ ^ 



XidXi + X2dX2 

^2 _ g2 



(44) 



Although there is still much freedom in our choice of the surface g, we now impose our 
desire to have a model that is conformalj^ We note that a metric will be conformal if it is a 
multiple of the Euclidean metric, even if the multiplicative factor varies from point to point. 
Thus, Eq. (jl^ will be a conformal metric if cross terms like dxidx2 are not present. To this 
end, we look for a surface g for which 

2 



which implies that 



g'{g'r - 2g) + r = 0. 



(45a) 
(45b) 



One way to solve this first-order non-linear differential equation for g is to look for solu- 
tions of the form 

g = + Br + C. (46) 
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If we substitute thus form for g into Eq. ()45b|l . we find that g will be a solution if the 
coefficients A, B and C satisfy the conditions 

2AB = 0, (47a) 
B^ + 4AC = 1, (47b) 
2BC = 0. (47c) 

One set of coefficients that satisfies these conditions is A = C = and i? = 0, in 
which case^ 

-±(i^)- («) 

Therefore, we can use Eq. (^Hjl in Eq. and find that 

ds^ = (—^^—) (dxl + dxl) (49a) 
— g / 



which implies that 



/ 4a \ 

[ 2r^_,._^ Ydx\ + dxl) (49b) 



ds^ = ( -^y{dxl + dxl). (49c) 



Note that because we prefer distances in velocity space to be non-negative and real, we have 
chosen K = —1 in Eq. ()49c|) . 

The surface described by Eq. fHHjl is 



X3 = 9 = ^^ = ^ (50) 

which is a paraboloid of revolution about the xs axis with vertex at xi = 2:2 = and 
X3 = 1/2. The paraboloid also goes through points with r"^ = xf+x^ = 1 and X3 = 1, which 
are on the light cone. Not only does the paraboloid touch the light cone at r = 1, but the 
light cone is tangent to the paraboloid at this point because 



A(ct) 



Air) 



d 

r=i dr 



r=l 



= 1, (51) 

r=l 



which is exactly the slope of the light cone. 

As shown in Fig., each ray emanating from the origin and inside the light cone (with 
the exception of the xs-axis) intersects the paraboloid twice: once below the disk X3 = 1 
and once above it. Because we only need a single point from each ray to create a velocity 
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space, we only use that part of the paraboloid that is below the disk (that is, points on the 
paraboloid with 0:3 < 1), as shown in Fig.. 

Finally, note that the line element squared given in Eq. ()49c|l can be expressed in plane 
polar coordinates as 



V. VELOCITY SPACE, RAPIDITY SPACE AND THE POINCARE DISK 

Although the paraboloid of revolution derived in Sec. IIVI is a valid model of relativistic 
velocity space, in most cases it is much easier to work in the space obtained by projecting 
this paraboloid onto the (xi,X2) plane by the projection {xi,X2,X3) 1-^ {xi,X2) shown in 



The space created by this projection is a unit disk with the metric in Eq. (|49c|) . and is 
known to mathematicians as the Poincare disk. Although both the Poincare and hyperboloid 
models are conformal (the first in two dimensions and the second in three), the Poincare 
model is superior for building intuition about the Thomas- Wigner rotation and Thomas 
precession because it can be drawn in two dimensions, which makes line segments and 
angles easier to visualize. 

As we shall see, the distance from the origin to any point on the Poincare disk (as 
determined by the line element in Eq. ()49cj) ) is just the rapidity associated with that point, 
which is why we refer to this disk as rapidity space.— Because points on the edge of the 
disk are defined by the projection of points on the intersection of the paraboloid and the 
light cone, they represent velocities with speed v = c. We shall see that these points are 
an infinite (Poincare) distance away from any point inside the disk, reflecting the fact that 
speeds can approach but never reach the speed of light. 

To simplify the notation, we rename the coordinates on the disk x (= xi) and y (= X2). 
We identify the physical significance of the distance s of any point from the origin by 




(52) 



Fig.. 
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evaluating^^ 



ds= [ (-^) \/dx^ + dy^ (53a) 



^0 

R 



1 — r 



dr (53c) 







1 — 



In(l^), (53d) 



which imphes that 



s = 2 arctanh R. (54) 

However a point on the disk whose radial coordinate is i? = ^ ^W- corresponds to the 
spacetime point (a, 6, °^+^^+^ )^ which hes on the ray in spacetime with slope 

ct (l + i?2)/2 



which implies that 



R R 
c l + R^ 



(55) 

. 2R (^^) 

^ ^ 2tanh(s/2) 

^ l + i?2 1 + tanh(s/2)tanh(s/2) ^ ' 

/3 = tanh(s/2 + s/2) = tanhs. (58) 

We thus conclude that the (Poincare distance) s of any point from the origin is 

s = arctanh /3 = 0. (the rapidity) (59) 

We can further clarify the nature of rapidity space by relating its coordinates [x, y) to 
the Euclidean velocity components Vx and Vy. First, note that a point {x,y) on the disk 
corresponds to a point {x,y, ^-y-) on the paraboloid, which means that 



and 

Equations (jUUI) and ()61|) give (3 (and its components) in terms of any radial coordinate 
r (and its components). To find the inverse relation (that is, the r associated with a given 
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we first solve Eq. for tlie magnitude of r: 

pr"^ - 2r + (3 = (62) 



r^'-^^ (63, 

r = 2^. (64) 

(We use only the negative root in the quadratic formula because r < 1). We substitute 
Eq. in Eq. (plj) and use the identity, 

7' - 1 = = (7/?)', (65) 



to find 



^ = = (66a) 



2 /' ^ V7 + I 
1 + / 7 



^^=(7tTi^^' ^^^^^ 



and 
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From Eq. (jUUI) we see that the x coordinate in rapidity space is proportional to and 
the y coordinate is proportional to jSy. The proportionality factor in both cases is 7/(7 + 1), 
which goes to unity as f ^ c and to 1/2 as — 0. Thus, in the limit as f — 0, the line 
element squared in rapidity space (Eq. ()49c|) ) reduces to 

ds^ = {d[3,f + {d^yf, (68) 

which, of course, is the line element squared of the usual Euclidean non-relativistic velocity 
space. 

To summarize, we have shown that requiring the velocity space metric to be invariant 
under Lorentz transformations leads to a model of relativistic velocity space that can be 
represented either as a paraboloid of revolution with its vertex at Xi = X2 = 0, X3 = | and 
top edge at X3 = 1, or as a unit disk with the metric given in Eq. ()49cj) . In this paper we refer 
to this (Poincare) disk as rapidity space because the distance from the origin to any point 
on the disk is its rapidity. Note that from the definition of rapidity, it is easy to see that 
any point on the edge of the disk is infinitely far from any point in the disk. Also, any point 
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in rapidity space is related to the components of any velocity by Eqs. (lUUj) and (jMj) . The 
conformal property of rapidity space can be seen explicitly by noting that if arctan (fy/f^.) 
is the angle made by a velocity vector v with the horizontal axis in real space, then, using 
Eq. (jnni), we have 



which means that the angle of (3 in real space is the same as the angle 6 in rapidity space. 
This property of rapidity space is what makes it so useful for understanding the Thomas- 
Wigner rotation and Thomas precession. 



We need to understand one more aspect of rapidity space before we can use it to inves- 
tigate the relativistic addition of non-colinear velocities and boosts. When we boost from 
one inertial frame traveling with a velocity Vi to another traveling with a velocity V2, we 
pass through the minimum number of velocities whose speeds are greater than vi and less 
than t>2. The corresponding path in rapidity space between the points representing these 
velocities is the shortest one which, by definition, is the geodesic connecting them. Hence, 
to study successive non-colinear boosts, we need to identify the geodesies in rapidity space. 
We shall see that these geodesies are straight lines if the origin (the point representing zero 
velocity) is one of the values taken on during the boost. In all other cases the geodesies are 
not straight lines but rather are the arcs of circles. Following these arcs from one point in 
rapidity space to another will bring out most of the interesting features of relativistic velocity 
addition, successive Lorentz boosts, Thomas- Wigner rotations and Thomas precession. 

We begin by showing that any geodesic that includes the origin of rapidity space is a 
straight line. Our proof parallels the traditional one showing that the shortest distance 
between two points in a Euclidean plane is a straight line, and is accomplished by finding 
the path of minimum distance connecting two points on the disk. The length of any path 
connecting the origin and a point {A, B) in rapidity space is 



arctan — = arctan — = 6 



(69) 



VI. GEODESICS IN RAPIDITY SPACE 




(70b) 



(70a) 
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with R = VA^TW, and 



e' 



de 

dr 



(71) 



If we use the Euler-Lagrange equation, we know that the integral in Eq. ()70b|) will be an 
extremum when 



d 



dd LVl 



1 + r2^'2 



d d 
d^dO' 



1 



0. 



The first term on the left-hand side is zero, so Eq. (f?^ reduces to 



d^d_ 



0, 



which implies that 



and 



d 
dr 



f r^9'^ 



h{e'). 



(72) 

(73) 
(74) 
(75) 



If we square both sides of Eq. ()75|) and rearrange the terms, we find h? {Ar^ + Br^ +Cr'^ + 1) = 
4r^6''^, with A, B, and C functions of 6' . In order for this equation to be satisfied for all 
r, including r = 0, h{6') must be zero. Hence, from Eq. (f7H|l we see that 6' also must be 
zero, and thus 6' is a constant whenever r = lies on the path. Therefore, any geodesic in 
rapidity space that includes the origin is a straight line, as shown in Fig.. 

Next we derive the geodesies in rapidity space that do not include the origin. Because 
the rapidity space metric is Lorentz invariant by construction, Lorentz transformations must 
send geodesies to geodesies. Thus, we can obtain a geodesic that does not include the origin 
by applying the same boost to every point on a geodesic that does include the origin. Rather 
than doing this directly, it is easier to obtain the final geodesic geometrically by projecting 
back and forth between the disk and the paraboloid. In this approach, we first identify the 
geodesies on the paraboloid that correspond to geodesies through the origin of the disk. 
We then apply the same boost to every point on one of these geodesies on the paraboloid. 
Finally, we project the Lorentz transformed geodesic on the paraboloid onto the disk and 
find the equation that describes it. 

We already have proven that any geodesic that includes the origin of the disk is a straight 
line. If we project one of these straight lines back up to the paraboloid, we see that it 
corresponds to a parabola through the vertex of the paraboloid, as shown in Fig.. Thus, any 
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such (vertically oriented) parabola is a geodesic. Equivalently, any one of these (vertically 
oriented) parabolas can be regarded as the curve formed at the intersection of the paraboloid 
and the plane defined by two axes and Xg, each of which has a point on the paraboloids' 
geodesic. 

If we now perform the same pure boost on every frame represented by a point on a 
parabola passing through the vertex of the paraboloid, we obtain a new geodesic that does 
not include the vertex. Because boosts are linear transformations, planes through the origin 
transform into other planes through the origin. Thus, the new geodesic can be described 
as the curve created by the intersection of the paraboloid and the new plane formed by the 
boosted and clXGS, clS shown in Fig.. If we project this curve onto the disk, we can find 
the equation of an arbitrary geodesic (on the disk) that does not include the origin: 

Because the boosted x'^ and define the plane whose intersection with the 

paraboloid determines the new geodesic, this plane must lie within the light cone. It fol- 
lows that there is a normal to the plane making an angle 6 with the original X3 axis, with 
7r/2> 9> 7r/4: 

vr 1 

< cos 61 < cos - = (76) 

On the other hand, if we express the normal to the plane as (a, 6,1), then any point 
{xi,X2,X3) on the plane satisfies 

(a, 6, 1) ■ (xi,X2,X3) = 0, (77) 
or axi + bx2 + X3 = 0. However, Eq. (fTUj) tells us that 

cosO^^-^^^^m^<' (78) 

which implies that ^/ a"^ + b"^ + 1 > \pl and + 6^ > 1. We thus conclude that the plane 
shown in Fig. is specified by 

axi + 6x2 + X3 = with + 6^ > 1. (79) 

Because any such plane can be obtained by boosting the appropriate x'^ and Xg, we conclude 
that planes of the form in Eq. ()79j) determine all the geodesies on the paraboloid. That is, 
a curve is a geodesic if and only if it lies on the intersection of the paraboloid and any plane 
including the origin. 
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Our main interest is in the points shown in Fig. that he on the intersection of the plane 
and the paraboloid. Any point on the paraboloid satisfies Eq. (|^. 

xs = l±^i±^. (80) 

If we combine Eq. fl80|) with Eq. (f?^ . we see that points on the curve formed at the inter- 
section of the plane and the paraboloid (that is, points on a geodesic on the paraboloid that 
do not pass through its vertex) satisfy 

- axi - 6x2 = ^^^1^, (a2 + 62>i) (gi) 

which implies that 

(xi + of + (X2 + 6)^ = + 6^ - 1 > 0. (82) 

If we project back down onto the disk as shown in Fig., we see that any geodesic that does 
not include the origin is the arc of a circle centered at (—a, —h) with radius ^ ^W- — \. 
Note that the center of any of these circles always lies outside of the unit disk because 

+ 6^ > 1, and that any point (a, 6) outside of the unit disk is the center of a circle on 
which some geodesic inside the disk lies. Also, because a and h can be positive or negative, 
it doesn't really matter whether we denote the center of the circle by (—a, —6) or (a, 6). 

Finally, we can prove that the circles derived in Eq. ()82|) are perpendicular to the edge 
of the disk at their points of intersection. We recall from Sec. El that points on the disk are 
denoted by {x,y), points on the edge of the disk satisfy 

x' + y' = l, (83) 

and points on a geodesic on the disk satisfy 

{x + + {y + bf = a^ + b'^-l. (84) 

By taking the differential of Eq. (jHHj) and Eq. ()84|) . we obtain 

2xdx + 2ydy = (85) 
2(x + a)dx+2{y + h)dy = 0, (86) 

which can be rewritten as 



dy X 

dx y 

dy X + a 

dx y + b 



(87) 
(88) 
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Equation (|H7jl gives the slope of the tangent to any point on the edge of the disk, and Eq. 
gives the slope of the tangent to any point on the geodesic. To prove that the geodesic is 
perpendicular to the edge of the disk at their point of intersection, we must show that these 
two tangents are perpendicular to each other; that is, that 

= y±^. (89) 

y X + a 

To do this, we note that any point on both the edge of the disk and on the geodesic satisfies 
both Eqs. (|H^ and (|HH| . and by adding these two equations together we find 

+ 2ax + 2?/2 + 2by = (90) 
x{x + a) +y{y + b) = (91) 

-- = (92) 

y X + a 

as required. Note that this argument can be reversed to show that any circular arc orthogonal 
to the unit circle at their points of intersection is a geodesic. 

The main results of this section are the following. As shown in Figs, and, geodesies 
through the origin of rapidity space are straight lines (in the Euclidean sense), and any 
straight line through the rapidity space origin is a geodesic. 

As shown in Figs, and , geodesies in rapidity space which do not include the origin are 
arcs of circles whose centers lie outside the disk and are perpendicular to the edge of the 
disk at their points of intersection. Conversely, any point outside the disk is the center of 
some circular arc within the disk that is a geodesic. As Fig. shows, the new feature is that 
rapidity space contains geodesies that are not "straight" in the Euclidean sense. One way 
to understand these curved geodesies is to note that the metric of Eq. ()49c|) tells us that 
segments near the edge of the disk are much longer then they appear, so the shortest path 
between two points near the disks' edge must be bowed inward rather than straight. 



VII. APPLICATIONS 



Now that we have found the Lorentz invariant metric and geodesies in rapidity space, 
we can investigate the relativistic addition of velocities and the various consequences of 
successive non-colinear Lorentz boosts. 
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A. Qualitative Results 

Before deriving quantitative expressions for the Thomas- Wigner rotation and Thomas 
precession, we first discuss results that can be deduced geometrically without the use of any 
equations. 

Velocities are represented by points in rapidity space. A boost from one velocity to 
another is represented in rapidity space by the geodesic connecting them. Because a pure 
boost doesn't involve any rotation of the reference frame being boosted, the coordinate axes 
representing the boost in rapidity space maintain a fixed angle with respect to the geodesic 
they follow, as shown in Fig.. When coordinate axes maintain a fixed angle with a geodesic 
as they move along it from one point to another, they are said to undergo parallel transport. 
The most familiar example of parallel transport occurs in nonrelativistic velocity space, in 
which all the geodesies are straight lines. 

We begin by considering a set of colinear boosts in real space. If we assume this set 
contains the zero velocity frame, then the first boost is represented in rapidity space by a 
segment of a straight line geodesic that includes the rapidity space origin. Without any loss 
of generality, we take that direction as the horizontal axis in both real and rapidity space. 
Because straight lines through the origin of rapidity space are geodesies, each colinear boost 
is represented by a segment of the same (horizonal) line. If we represent a coordinate system 
in rapidity space by two small perpendicular lines (crosshairs) centered on the point of 
interest, then as shown in Fig., when we boost from one velocity to another, the orientation 
of the crosshairs remains fixed with respect to the geodesic connecting them. Thus, the 
orientation of the crosshairs is unchanged no matter how many colinear boosts it undergoes 
because, in each case, it is moving along the same straight line geodesic in rapidity space. 

Furthermore, because the distance (as measured with the rapidity space metric) from the 
origin to any point in rapidity space is the rapidity of that point, we see that when successive 
boosts are colinear, the corresponding rapidities add and subtract like ordinary numbers. 
Thus, rapidity space provides an easy geometrical way to obtain Eq. ()19ej) and to prove that 
frames boosted in the same direction do not rotate with respect to each other. Working in 
rapidity space also provides an easy proof that no matter how many colinear velocities are 
added together, the magnitude of their sum always will be less than the speed of light. 

A boost that does not include the zero velocity frame corresponds to a geodesic in rapidity 
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space that does not include the rapidity space origin. As we proved in Sec. I VII this geodesic 
hes on the arc of a circle whose center is outside the disk. As shown in Fig., crosshairs 
moving along this type of geodesic maintain their orientation with respect to it. Therefore, 
crosshairs moving back to the origin along a closed path that includes one (or more) of these 
geodesies will be rotated with respect to their initial orientation. An example of this rotation 
is shown in Fig.. Suppose we boost from rest to a velocity v along the x-axis (in real space). 
Then we perform a non-colinear boost from a frame with velocity vto a. frame with velocity 
v', and finally, we boost from a frame with velocity v' back to the original rest frame. If we 
look at the corresponding points in rapidity space as shown in Fig., we see that the frame 
obtained at the end of these three boosts is rotated with respect to the one that stayed at 
the origin. This rotation is the Thomas- Wigner rotation, which we denote by TWR, and 
the geometry of rapidity space shows that the TWR is in the clockwise (negative) direction 
when a frame is moving in rapidity space in the counterclockwise (positive) direction (and 
vice versa). 

It also is easy to see that there is an upper limit on the TWR angle. Without any loss of 
generality, suppose we first boost along the x-axis (in real space) to a frame whose speed is 
very close to the speed of light. There is no orientation change of the boosted frame because 
the geodesic it follows in rapidity space is a straight line. If we next perform a non-colinear 
boost to a speed even closer to the speed of light, which makes an angle slightly less than 
TT with the X-axis, then as shown in Fig., the geodesic representing this second boost will 
lie on the arc of a circle that is perpendicular to the edge of the disk (where f = c) at its 
two points of intersection. In the limit that both speeds approach the speed of light and the 
angle between the boosts approaches vr, the arc representing the second boost approaches a 
half circle. Thus, the change in orientation of a reference frame following the arc approaches 
vr. Consequently, any Thomas- Wigner rotation angle has an upper limit of vr, and the limit 
of TT is approached only when the two boosts involved have speeds very close to the speed of 
light and are almost opposite to each other. In all other cases the TWR angle will be less 
than TT. 

On the other hand, if we perform the same two non-colinear boosts as before, but now 
give each a nonrelativistic speed {(3 <^ 1), then as shown in Fig., even though the geodesic 
representing the second boost still lies on the arc of a circle, that arc is indistinguishable 
from a straight line because it is located near the rapidity space origin. Therefore, when the 
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boost speeds are nonrelativistic, there is essentially no rotation of the frame in following the 
two geodesies. 

Next consider a reference frame in real space undergoing circular motion in the counter- 
clockwise (positive) direction with a constant, nonrelativistic speed. Classically, this situa- 
tion is treated by representing it as the limiting case of a set of small, non-colinear boosts. 
That is, circular motion is approximated as motion along a polygon with an ever increasing 
number of sides. Because the boosts involved all have the same nonrelativistic speed, all 
the circular arcs representing them in rapidity space are essentially indistinguishable from 
straight lines. Hence, as long as the speed of the frame in circular motion is nonrelativistic, 
it undergoes essentially no change in orientation upon its return to the origin (as we would 
expect). 

Now suppose the reference frame undergoing circular motion has a constant speed that is 
relativistic. The geodesies in rapidity space that form the polygon are now small arcs that 
lie on circles whose intersection with the edge of the disk is orthogonal, as shown in Fig.. 
Consequently, the frame being boosted undergoes a definite change in orientation with each 
boost. Thus, when the reference frame returns to its starting point, it will have undergone 
a clockwise (negative) rotation with respect to its initial orientation. This sum of all the 
rotations experienced along the way is the Thomas precession. Furthermore, we see from 
the geometry of rapidity space that the amount of rotation will be a function of the speed of 
the circular motion (that is, the rapidity space distance from the rapidity space origin), and 
will increase without bound as this speed approaches the speed of light, as shown in Fig.. 

We also can use the geometry of rapidity space to show that non-colinear, relativistic 
boosts are also (in general) noncommutative. Suppose we first boost a reference frame from 
speed zero to a speed close to the speed of light, and then boost the frame through a second 
non-colinear velocity. From the rapidity space diagram in Fig., it is clear that we will end up 
at a completely different point if we do the same boosts in reverse order. However, if we look 
closely at the rapidity space diagram, we see that although the two resultant velocities are 
represented by different points in rapidity space, they both are the same (rapidity space) 
distance from the rapidity space origin. Consequently, they both have the same rapidity 
(and hence speed), but not the same direction. The result that the final speed resulting 
from two successive non-colinear boosts is independent of the order in which the boosts are 
applied is usually proved by a somewhat long algebraic calculation.^^ 
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Finally, it is easy to see that the sum of any number of non-colinear relativistic velocities 
always results in a velocity whose magnitude is less than the speed of light. Although obvious 
when viewed on a rapidity space diagram, the corresponding algebraic proof is somewhat 
complex. 

B. Relation of the Thomas- Wigner rotation to the rapidity space triangle 

Because angles in rapidity space behave exactly like angles in Euclidean space, it is 
relatively easy to quantify the arguments of Sec. lVIlXl As shown in Fig., the angles between 
the boosts are 03, ai, and a2- The straight line geodesic making an angle with the 
horizontal axis is called $3, and the other two geodesies are $1 and $2- The length (rapidity) 
of the segment of the geodesic $j representing each boost is denoted by (pi. We see in Fig. 
that when we boost from the origin along $1, there is no change in the orientation of the 
crosshairs. As shown in Fig., when we boost along $2, the x-axis of the crosshairs maintains 
its angle n — ai with respect to $2 because $2 is a geodesic. Finally, as shown in Fig., 
boosting back to the origin along $3, the x-axis of the crosshairs maintains its orientation 
of (tt — ai) — a2 with respect to the geodesic $3. Thus, as Fig. shows, when the coordinate 
system returns to the rapidity space origin, its x-axis will have rotated from its initial 
orientation in the clockwise (negative) direction by the Thomas- Wigner rotation angle 



Note that if the two rapidities 0i and 02 are small (that is, if the speeds they represent 
are nonrelativistic) then, as shown in Fig., the figure formed by the segments of the three 
geodesies $j is indistinguishable from a Euclidean triangle (because in this case ai+a2+«3 ~ 
tt) and, as expected, the TWR angle is essentially zero. On the other hand, if the two 
rapidities 0i and 02 are large, then the resulting TWR angle can approach the upper limit 
of TT, as discussed above. 

The absolute value of the right-hand side of Eq. is known to mathematicians as the 
"angular defect," because it is a measure of how much the sum of the angles inside a triangle 
differs from the corresponding sum in ordinary Euclidean space (vr). The theorem that the 
angular defect of a triangle is equal to its area is proven in hyperbolic geometry courses. 
Rather than simply invoking this result, we can establish it from first principles. Even if we 




(93) 
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had no previous knowledge of hyperbolic geometry, we might suspect that the area enclosed 
by the rapidity space triangle is proportional to the TWR angle, because our method for 
finding the TWR angle involves traveling around a closed path and summing up the angular 
change along the way. This sum corresponds to evaluating an integral of the form 

de. (94) 



c 

An integral like (jMj) appears in Green's theorem, which relates an integral around a closed 
curve to an integral over the two-dimensional area enclosed by that curve. We now show 
that the integral Eq. ()94|) is actually the left-hand side of Green's theorem for a particular 
choice of the integrand, and that this choice makes the right-hand side of Green's theorem 
equal to the area enclosed by the curve. 
We begin by writing Green's theorem as 



F-ds = jj ^ ^ F-hda, (95) 

C E 

where C signifies any closed (2D) curve traversed in the counterclockwise direction, n is the 
unit vector normal to the plane of this curve (according to the right-hand rule), S stands 
for the region enclosed by the curve, and F is any vector defined on rapidity space. 

To apply Green's theorem in rapidity space, we first must find explicit expressions for 
each integrand. To do this, recall that 

ds"^ = ^ (dx'^ + dy^^ = hi dx^ + dy^, (96) 



and 

From Eq. (jHZI), we find that the area element in Cartesian and plane polar coordinates is 



ds=( ^ )dxi+i " ^ )dyj. (97) 



da 



Al — r 



dx 



1 — r 



dy 



(98) 



2 \ 2 / 2 \ 2 

dxdy= rdrdO. (99) 



We also can give a more informal derivation of Eq. (j99|) . Because the metric Eq. (|97|) is 
conformal, it is locally a multiple of the Euclidean metric (even though this multiple varies 
from point to point). Therefore, we expect the area element also to be a multiple of the 
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Euclidean area element, with the multiplying factor equal to (2/(1 — t^))^, because the area 
is the product of the infinitesimal length in each of the two orthogonal directions and each 
length is the Euclidean length multiplied by the factor 2/(1 — r^). 

To evaluate the integral on the right-hand side of Eq. (jHSj), we need to express the curl 
and dot product in the coordinates of rapidity space. From Boas^i we have 

1 



V X F -n 



hih2 
1 - 



ox \ / ay 
dx 



d_ 

dy 



1 



(100a) 
(100b) 



Because Green's theorem is true for any vector F, it holds for the particular vector F 
—yi + xj. For this choice, the right-hand side of Eq. ()100b|) reduces to 1. 
Thus, when F = —yi + xj. Green's theorem becomes 



Area(S) = j F ■ ds, 
c 



(101) 



where the area enclosed by the closed curve C is calculated using the rapidity space area 
element Eq. ()99|). If we substitute our choice for F into the right-hand side of Eq. ()101|) . we 
find 



F -ds 



-yi + xj) 



1 - 



dxi + dyj 



c 



(102a) 
(102b) 



c 



Therefore, Eq. ()101|) can be written as 

Area(S) = 



[xdy — y dx). 



(103) 



c 



We now show that the right-hand side of Eq. p03|) is equal to an integral of the form 
given in Eq. (jMj) . Let r and 6 be the usual plane polar coordinates measured from the center 
of the disk. In terms of these coordinates, 



x = rcos6 and = —r sin 6'(i6' + rfr cos 6', 
y = r sin 6 and dy = r cos 6d6 + dr sin 6. 



(104a) 
(104b) 
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We substitute these coordinates into Eq. ()lU3p and find that 

xdy — ydx = r'^dO, (105) 



Area(S) = (t ( ) dO. (106) 



which imphes that 

^ref^.(y^ = J) I 

^1 — r 

c 

By looking at the hyperbohc triangle in Fig. representing the three boosts, we see that 
two of the three sides are straight lines emanating from the origin, which means that d9 = 
for these geodesic segments. Consequently, the integral along each of these segments makes 
no contribution to the path integral. Therefore, only the integral over the curved geodesic 
contributes to the right-hand side of Eq. p06|) . We can evaluate this integral by changing it 
from one in terms of the polar coordinates (r, 6) measured from the center of the disk to an 
integral in terms of coordinates measured from the center (a, b) of the circle on which the 
curved arc lies (see Fig.). If y/ a'^ + b"^ — 1 is the radius of this circle and u the corresponding 
angular coordinate (defined as positive in the counterclockwise direction), then the desired 
coordinate transformation is 



X = a — voM-P— T coscu, (107a) 
y = b-Va^ + b'^-lsmu. (107b) 



After some algebra, we find that 
2 



1-^2 1 - (a2 + 62) _j_ + 52 _ i(acosu; + bsmu) ' 

and 



(108) 



xdy — ydx = — [l — (a^ + 6^) + voM-P— T (a coscj + bsmu)~\du!. (109) 
If we use Eqs. fllOHj) and ()109j) in the integrand of Eq. (jlOHj) . we conclude that 

{xdy — ydx) = —du. (HO) 

Using Eq. (jllOj) in the integral in Eq. pOHj) . we see that the integral in Eq. is zero on 

the straight lines through the origin, while on the other geodesic, it is (to within a sign) the 
angular extent of the geodesic segment about its center. 

If we look at Fig., we see that the two radii of the circle centered at (a, b) together with 
the two sides of the triangle which are straight lines, form a four-sided figure (that is, a 
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Euclidean quadrilateral). Because the sum of the angles in a Euclidean quadrilateral is 
27r,— we have that 

27r = a3+ + +^+ (tt2 + |) (lHa) 
= {ai + a2 + as) + Lu + Tc, (Ulb) 

which implies that 

LU = TT - {ai + a2 + as). (112) 
Therefore, the area enclosed by the triangle is 

Area(E) = - / du = du (113a) 
= TT - (ai + 02 + 03) = -(TWR). (113b) 

Note that because we have proved the area of our special triangle is just the angular sweep 
of its one curved side, it is easy to prove that the area of any geodesic-sided polygon is the 
sum of the angular sweeps of its sides (about their various centers of curvature). 

We have thus proved the main result of this section, that the negative of the Thomas- 
Wigner rotation is equal to both the (rapidity space) area enclosed by the rapidity space 
triangle and the angular defect (vr minus the sum of the interior angles of the rapidity space 
triangle). Although we have derived this result from first principles, it was pointed out by 
Aravind,— and later discussed in a slightly different context by Criado and Alamo.— 

Although the result (jll3b|l is interesting in its own right, it also suggests that another 
way to evaluate and compare Thomas- Wigner rotations is to look at the areas of the corre- 
sponding triangles in rapidity space. Although possible in principle, in practice this is not 
very easy to do because areas in rapidity space depend on where they are located, and thus 
are not readily compared using our Euclidean-trained eyes. More specifically, as Eq. ()49c|) 
for the area element da shows, although two regions in different parts of rapidity space may 
appear to have the same area to our Euclidean-trained eyes, the area of the one closest to 
the edge of the disk is larger. 

C. Various Equations for the Thomas- Wigner Rotation Angle 

We now are in a position to derive the various expressions for the Thomas- Wigner rotation 
angle that have appeared in the literature. Because most of these expressions are for the 
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magnitude of the TWR, we can use Eqs. ()1U3|) and ()113b|) to write 

TWR ^ ^ 



l-r2 
2 

1 — r 
2r2 



xdy — ydx 

^ \{r X df) ■ k\ 
(r X df) ■ k 



.1 — 

If we use Eqs. (jB^ and we find that 

2r2 



(114a) 
(114b) 

(114c) 



1 -r2 



7-1. 



(115) 



Equation ()67|) can then be used to show that for any infinitesimal segment of the path, 



(r X dr) ■ k 




[v X dv) ■ k 






y2 



(116) 

which means that Eq. ()114cj] can be rewritten as 

f , V X dv , , 

|TWR| = ^ (7- 1) — . (117) 

Equation ()117|1 can be re-expressed in various forms. For example, if we call the integrand 
dx, then 

7-1 



V 

y2 



Hence, 



dx ^ 7-1 
dt v^ 



\v X dv\ 
\v X a\ dt 

\v X a\ . 



(118a) 
(118b) 

(119) 



Equation pi8all is the expression for the Thomas- Wigner rotation angle given in Ref. 0, 
p. 289 and Ref . fl, p. 178. 

Several interesting physical properties can be deduced from Eq. ()117|l . First, the right- 
hand side tends to zero in the nonrelativistic limit, showing that in this limit the Thomas- 
Wigner rotation vanishes. Second, as f — * c, the Thomas- Wigner rotation angle increases 
without bound, as we deduced in Sec. IVII Al using the geometry of rapidity space. Third, 
the Thomas- Wigner rotation is a purely kinematic effect because it is independent of the 
dynamics causing the acceleration. In other words, it not only occurs for charged particles 
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moving in electromagnetic fields, but also can occur for elementary particles accelerated by 
nuclear forces, and for masses accelerated by gravitational fields. 

If we multiply the right-hand side of Eq. by (7 + l)/(7 + 1) and use the identity 

given in Eq. (piKjl . we find that 



dx 7 

00 



2 



dt 7 + 1 



(7-1) 



(120) 



which is the expression for the angular speed of the Thomas- Wigner rotation given in Ref. 0, 
p. 290, Ref. Q, P- 554, and Ref. 0, p. 179. Because it is the angular velocity of the Thomas- 
Wigner rotation that enters into the calculation of the Thomas precession, it is this quantity 
that appears in the relativistic correction to the spin-orbit term in the Hamiltonian for a 
hydrogen atom. 

D. Applying the Thomas Precession in Quantum Theory 

Most derivations of the relativistic correction to the spin-orbit term in the Hamiltonian 
for a hydrogen atom relate the time rate of change of the electron's spin vector in its instan- 
taneous rest frame to the corresponding rate in the lab (or proton's rest) frame.-^ Because 
any instantaneous rest frame of the electron is obtained from the previous instantaneous 
rest frame by a non-co linear Lorentz boost, the transformation back to the lab frame will 
include Thomas-Wigner rotations. The rate at which the Thomas- Wigner rotations occur 
is the rate given in Eq. ()119j) . There are several excellent derivations of the correct form of 



the relativistic correction to the spin-orbit term (see, for example, Refs. or Ref. l42l ) , 
and Eq. pi9|l is used in all of them. 

Some authors^ claim the factor of two that comes from including the Thomas precession 
in the spin-orbit term results from the electron's rest frame precessing through one complete 
cycle each time it completes one revolution around the proton. However, as Eqs. ()120|) 
and ()124|1 show, this interpretation is incorrect because the number of rotations completed 
during each revolution is variable, and tends to infinity as f — > c. 

E. A Special Case of Thomas Precession 



The Thomas precession of an object is the sum of all the Thomas-Wigner rotations it 
undergoes when it completes one closed planar orbit. To see this explicitly, consider the 
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simple example of an object moving in a circle with a constant speed. If we use the area 
element given in Eq. (jHH), the expression for the magnitude of the Thomas precession for 
this case is 

10 Jo 



|TP| = / / TT^T^^^^^^ (121a) 



= ^^l (137^^-- (121b) 

We can evaluate this integral by changing the integration variable to m = (1 — r^). After 
some algebra, we find 

|TP|=47rf-^). (122) 



By using Eq. (jll5|) . we see that 



' (123) 



which means that 

|TP| = 27r(7 - 1). (124) 

Note that if the object moves around the circle in the clockwise (negative) direction, then 
the Thomas precession is in the opposite (positive) direction after one revolution around the 
circular path. This result also is derived in Ref. la, p. 179. 



VIII. MATHEMATICAL CONNECTIONS AND ALTERNATIVE EQUATIONS 
FOR THE THOMAS- WIGNER ROTATION 

The purpose of this section is to give a brief discussion of the relation between the results 
presented in this paper and Mobius transformations, spinors, the group 5'L2(C), and models 
of the hyperbolic plane. The only new physical result is Eq. p46p . which expresses the 
Thomas- Wigner rotation angle in terms of the rapidities that give rise to it and the angle 
between their corresponding boosts. Equation p46|) also can be obtained geometrically in 
rapidity space; it is just easier to derive in the present context. 

We have shown that rapidity space and the actions of Lorentz transformations on it 
provide valuable insight into the Thomas- Wigner rotation and the Thomas precession. Al- 
though our presentation has not required an actual algebraic expression for the action of 
Lorentz transformations on rapidity space, it is natural to ask for one. Once we have this 
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expression, it will be easy to relate Lorentz transformations to certain Mobius transforma- 
tions (linear fractional transformations), and then to the spinor map between 5*172 (C) and 
the Lorentz group. 



A. Lorentz Transformations of Rapidity Space 



To see how the Lorentz transformation of Eq. ljlOj) (a boost in the positive x-direction 
with rapidity 0) acts on a point (x, y) in the Poincare disk, let (x, y, (x^ + y'^ + l)/2) denote 
the point on the paraboloid that projects to this point in the disk (see Fig.). If we apply 
the boost to this vector, we obtain 



^ ^ \ / (cosh0)x-sinh0(2^±f±i) ^ 



X 

y 

2 



y 



y 



- (sinh 0) X + cosh ( ^ "'"^ 



(125) 



which we then need to rescale so that it lies on the paraboloid. Some rather messy algebra 
shows the correct scaling factor is 



^ / cosh 0+1 , . , , , cosh ( 
A = I (smh 0)x + 



2 ' 2 

and thus the boost maps points in the Poincare disk by 



-ix' + y^r' 



(126) 







(A 










H 











A((cosh0)x - sinh0( "'+f+^ )) 
Ay 



(127) 



Equation p27j) can be expressed in a surprisingly simple way if we use complex notation to 
denote points in the disk. If we let 2; = x + iy, and set 



cosh + 1 



cosh — 
2 



cosh( 



sinh — , 
2' 



(128) 
(129) 



the action of the boost becomes 



az + b 



z ^ z 



(130) 



hz + a 

Thus, when we use complex notation to label points on the Poincare disk, the action of the 
boost can be expressed in a particularly simple way as a Mobius transformation. 
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Another special type of Lorentz transformation that is easy to analyze is a spatial rotation. 
It is not difficult to see that a counterclockwise spatial rotation by an angle of 6 produces 
the map of the disk 

ie/2 ^ _|_ n 

z^z' = e^'z = ^ \l , (131) 

which is again a Mobius transformation. 

Given any Mobius transformation z t-^ ff^? with a, b, c, d normalized so that ad — bc= 1, 

[a b\ 

we may associate* with it the matrix of determinant 1. The composition of two 

yc dj 

Mobius transformations corresponds to multiplication of the corresponding matrices, and 
an inverse transformation corresponds to the inverse matrix. Thus the boost and rotation 
are associated with matrices 

f cosh I -sinhA g \ 

B.{<P) ={ \ ; , R{e) = , (132) 

\^-sinh| cosh I J y e-'^/y 

which both have the rather special form, 

M{a,P)= f" ^1 , (133) 
\(3 aj 

for complex numbers a and (3 with aa — fifi = 1. Furthermore, for any such matrix M{a, (3) 
with 9i = arg(Q;) + arg(/?) + vr, 6*2 = arg(a) — arg(/?) — vr, and such that cosh | = |a| and 
sinh I = |/?|, we have 

M{a,(3) = R{e,)BMR{G2). (134) 

Thus the matrices arising from boosts and rotations generate all matrices of the form in 
Eq. (im . 

In fact, the Mobius transformations associated with matrices of the form in Eq. ()133|) 
are known to be all the (orientation-preserving) conformal maps of the Poincare disk to 
itself.-^ Because every Lorentz transformation (on (2+l)-dimensional space) must give rise 
to a conformal map of the disk, and every such conformal map arises from a product of 
two rotations and a boost B^., then not only do all the conformal maps arise from Lorentz 
transformations, but also every Lorentz transformation is a product of at most two rotations 
and a boost in the x-direction. 
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B. The Upper-Half Plane Model 

■ A A 

The transformation associated with I I maps the disk conformally onto the set 

\^ l) 

of points z = X + iy with y > 0, and results in the upper half-plane model. The conformal 
transformations of this model are the Mobius transformations corresponding to 2 x 2 real 
matrices of determinant 1, that is, to the group SL2(M.), since 

-1 

(135) 

ranges through SL2{M.) as a, (3 range through all complex numbers with aa — (3(3 = 1. 

Thus Lorentz transformations (on (2+l)-dimensional space) correspond to elements of 
SL2{M(), and the action of a Lorentz transformation on rapidity space is simply the action 
of the corresponding Mobius transformation on the upper half-plane model. 




C. Extension to Three Spatial Dimensions and the Spinor Map 

Although our discussion has been limited to (2+l)-dimensions for ease of exposition, all 
the work carries over in a fairly straightforward way to (3+l)-dimensions (or more). A 
higher dimensional paraboloid within the light cone leads to a conformal model of rapidity 
space, which is now the interior of a unit ball. The metric is given by^ 

ds^ = jz :r-r{dx^ + dy^ + dz^), (136) 

[1 — x"^ — y^ — z'^)^ 

and the geodesies are arcs of circles that intersect the bounding sphere orthogonally. Within 
the ball, the surfaces formed by pieces of spheres centered outside the unit ball which 
intersect the unit sphere orthogonally should be though of as "planar," because geodesies 
remain inside them. Any of these surfaces can be mapped (by a conformal transformation 
of the ball to itself) to a disk bounded by the equator of the ball. The geometry of such a 
disk arising from its embedding in the ball is the same as the geometry developed here for 
the Poincare disk. 

Finally, in addition to the ball model, there is an upper half-space model composed 
of points in where the third coordinate is positive. Although points in it cannot be 
naturally identified by complex numbers — it is after all three-dimensional — they can be 
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identified with certain quaternions x + iy + jz, where z > 0. The (orientation-preserving) 
conformal transformations of this space are identified with matrices in SL2{C), where the 

[a b\ 
matrix acts by— 

yc dj 

X + iy + jz ^ {a{x + iy + jz) + b){c{x + iy + jz) + d)~^. (137) 

The correspondence of Lorentz transformations, which give rise to conformal transformations 
of the model, to elements of SL2{C) is usually called the spinor map.-^ 

D. More Useful Forms of the Thomas- Wigner Rotation 

Because we have identified in the matrix Bx{4>) with a boost of rapidity (p iii the x- 
direction and the matrix R{6) with a spatial rotation through an angle 6 (see Sec. IVIII A|l . 
we can derive a relatively simple equation for the Thomas- Wigner rotation produced by two 
successive, non-colinear, boosts. 

As is easily proved, a pure boost with rapidity (j) in the direction of 6 can be obtained 
by first rotating through —6, then applying an x-boost of 0, and then rotating back by 6. 
Expressing these three operations with matrices, we have 

(cosh I — sinh ^e'^ \ 
— sinh |e cosh | / 

Therefore, as shown in Figs., a boost with a rapidity of (pi in the x-direction, followed by a 
boost of rapidity 02 in the 6 = n — ai direction, corresponds to 

(cosh % — sinh ttC*^ \ / cosh % — sinh ^ \ 
— sinh cosh ^ / \~ ^^^^ ^ cosh ^ / 

On the other hand, any Lorentz transformation in the direction of uji can be expressed 
as the product of a boost R{lji)Bx{4>3)R{—uji) in the ui direction followed by a rotation 
through an angle uj2- Expressing these operations with matrices, we have 

RM (i?(a;i)5,.(03)i?(-^i)) . (140) 

In the specific case shown in Figs and , uj2 is the Thomas- Wigner rotation angle and Ui is 
the angle 0^3. Thus, the product in Eq. ()139|) must equal the product in Eq. ()14U|) which, 
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when expressed in matrix form, is 




3*^2/2 \ / cosh^ -sinh^e*^! 
-*^2/2 I 1 -sinh^e"*'^! cosh^ 



:i4ii 



If we solve for uj2 by equating the upper left entries of Eqs. ()139|) and ()141|) . we find 



UJ2 = 2 arg ( cosh — cosh — + sinh — sinh ) (142a) 
= 2 arg (1 + tanh y tanh y e'^) . (142b) 

Equation ()142b|l is an algebraic formula for the Thomas- Wigner rotation uj^ resulting 
from a boost with rapidity 0i in the x-direction followed by a boost with rapidity 02 in the 
Q = T{ — a\ direction (as shown in Figs, and ??). Note that Eq. (jl42bj) readily produces the 
qualitative results we derived in Sec. I VII Al For example, it shows that the Thomas-Wigner 
rotation will take on values between — vr and vr, and will approach its largest value when 
both velocities are near c and Q is near tt. Equation ()142b|) also shows that the magnitude 
of the Thomas-Wigner rotation is the same regardless of the order in which the boosts 01 
and 02 are applied. 

We end this section by noting that the method used to derive Eq. fll42bj) also can be 
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used to find the rapidity 03 and the angle 03, and to derive the equations given in Ref. 
for tan (a;2/2), cosh 03, and tan 0:3. For example, if we equate the real and imaginary parts 
of the upper left entries of Eqs. ()139|1 and ()14H1 . we find 



cos — cosh — = cosh — cosh — + sinh — sinh — cos 6*, (143) 

2 2 2 2 2 2' ^ ' 

and 

sin — cosh — = sinh — sinh — sin d. (144) 

2 2 2 2 ^ ' 

If we divide Eq. (fTI^ by Eq. dUBI), we obtain 

tan— - sinh (0i/2) sinh (02/2) sing 

2 ~ cosh (0i/2) cosh (02/2) + sinh (0i/2) sinh (02/2) cos g' ^ ' 

which is Eq. (2) in Ref. [2^ If we divide the numerator and denominator of Eq. ()145j) by 
sinh (0i/2) sinh (02/2), we obtain the simpler expressions^ 
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The coefficient D can be written as 



D 



cosh0i/2\ / cosh 02/2 



(147) 



sinh(/)i/2 / \sinh(/)2/2 
e"^! + 1\ fe'^^ + 1\ 
e'^i - 1 y Ve<^2 -ij ' 



(148) 



which, from Eq. (j2DI), is simply a ratio involving Doppler blueshift factors. Alternatively, if 
we use Eqs. dHHl), ^M, and (|9EI) in Eq. (fTTTjl . we see that 



Equation p46|) . together with either Eq. p48|) or ()149|) . is the simplest expression we have 
seen for the Thomas- Wigner rotation angle uj2- 



We have presented a self-contained derivation of a relativistic velocity space called ra- 
pidity space. We then demonstrated how this space can be used to visualize and calculate 
various effects resulting from the successive application of non-colinear Lorentz boosts and 
the relativistic addition of non-colinear velocities. In particular, we showed how rapidity 
space provides a geometric approach to the Thomas- Wigner rotation and the Thomas pre- 
cession, and how it offers both qualitative and quantitative insight into these (and other) 
effects. 
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